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NOVEL TUMOR SUPPRESSOR GENE AND COMPOSITIONS AND METHODS FOR 

MAKING AND USING THE SAME 

[OOOIJ This invention was made with Government support under Program Project 
Grant P01CA76259, P01CA81534, and P30CA56036 from the National Cancer Institute. The 
Government has certain rights in this invention. 

CROSS REFERENCE TO RELATED APPLICATIONS 

[0002] This application claims the benefit of U.S. Provisional Application No. 
60/417,842, filed October 11, 2002, incorporated herein by reference in its enturety. 

FIELD OF THE INVENTION 

[0003] The invention relates to the identification and cloning of ARTS 1 , a novel tumor 
suppressor gene, and to methods of making and using the same. ARTSl was originally 
designated AKLTSl, and may also be referred to as ARLSl . 

BACKGROUND OF THE INVENTION 

[0004] Loss-of-fimction mutations in tumor suppressor genes (TSGs) play an essential 
role in the initiation and progression of human tumors, while inactivation by me&ylation seems 
to be important for tumor progression (Weinberg, R.A. Tumor suppressor genes. Science 254, 
1 138-46. (1991), which is incorporated herein by reference). Chromosome 13 at band ql4, 
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where the retmoblastoma (RBI) gene (Marshall, CJ. Tumor suppressor genes. Cell 64, 313-26. 
(1991), which is incorporated herein by reference) is located, is hemizygously or homozygously 
deleted in a diversity of hematopoietic and sohd tumors (Bulhich, F. & Croce, CM. Molecular 
biology of chronic lymphocytic leukemia. In Chronic Lymphoid Leukemias (ed. Chenson, B.D,) 
9-32 (Marcel Dekker, Inc., New York Bassel, 2001), which is incorporated herein by reference). 
Several reports presented evidence for a new tumor suppressor locus telomeric to the RBI gene 
(Brown, A.G., Ross, P.M., Dunne, E.M., Steel, CM. & Weir-Thompson, E.M. Evidence for a 
new tumom: siippressor locus (DBM) in human B-cell neoplasia telomeric to the retinoblastoma 
gene. Nat Genet 3, 67-72. (1993), Howthom, L.A., Chapman, R., Oscier, D. & Cowell, JJL 
The consistent 13ql4 translocation breakpoint seen in chronic B-cell leukaemia (BCLL) involves 
deletion of the D13S25 locus which lies distal to the retinoblastoma predisposition gene. 
Oncogene 8., 1415-9 (1993) and Liu, Y. et al. Chronic lymphocytic leukemia cells with allelic 
deletion at 13ql4 commonly have one intact RBI gene: evidence for a role of an adjacent locus. 
Proc Natl Acad Sci U S A 90, 8697-701. (1993) which are each incorporated herein by 
reference). However, none of the genes in the region were foxmd to be inactivated by either 
combination of deletion, mutations or promoter hypermethylation. 

[0005] There is a need to identify and clone TSGs whose loss of function are associated 
with initiation and progression of human tumors. There is a need to identify a TSG telomeric to 
the RBI gene. There is a need to identify nucleic acids which can serve as probes or primers for 
the detection of the TSG. There is a need for genetic based therapeutics which can be delivered 
to function m cells with a TSG mutation. There is a need for isolated protein and for antibodies 
which specifically react to the protein. There is a need for assays, reagents and kits to identify 
compounds that can upregulate, enhance or compensate for inactivity of the TSG. There is a 
need to study and understand the mechanisms by which the TSG is involved in initiation and 
progression of tumors and for reagents useful in such studies. There is a need to identify new 
cancer therapeutics and for kits and methods of identifying such compounds. 

SUMMARY OF THE INVENTION 

[0006] The invention relates to isolated proteins comprising the amino acid sequence 
shown in SEQ ID NO:2, 
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[0007] The invention relates to isolated nucleic acid molecules that comprise nucleic 
acid sequences that encode a protein that has an amino acid sequence shown in SEQ ID NO:2. 

[0008] The invention relates to isolated nucleic acid molecules that comprise SEQ ID 
NO:l or a fragment thereof having at least 10 nucleotides. 

[0009] The invention relates to a recombinant expression vector comprising the nucleic 
acid molecule comprising SEQ ID NO:l. 

[0010] The invention relates to a host cell comprising a recombinant expression vector 
comprising the nucleic acid molecule that comprises SEQ ID NO:L 

[0011] The invention relates to an oligonucleotide molecule comprising a nucleotide 
sequence complimentary to a nucleotide sequence of at least 5 nucleotides of SEQ ID NO:l. 

[0012] The invention relates to isolated antibodies that bind to an epitope on SEQ ID 

NO:2. 

[0013] The invention relates to methods of identifying modulators of Caspase-1 
protease activity. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0014] Figure 1. Localization of ARTSl tumor suppressor gene at 13ql4. A. The 
position of genetic markers and positions of genes on the map is shown. B. A multiple 
alignment of human ARTS 1 (SEQ ID NO:2) with human ARL proteins. Several motifs 
presumably involved in nucleotide binding and hydrolysis (PMl, PM3, G2 and G3), 
characteristic of Ras-related GTPases27, are also present in ARTSl. Furthermore, five 
additional aminoacids typical of the ARF subfamily (G2, N47,W74, R95 and G161) are all 
conserved in ARTSl. In the C-terminus ARTSl harbors less argmine or lysine residues than 
ARL4, ARL6 and ARL7. The location of Trpl49Stop mutation is indicated by an arrow. 

[0015] Figure 2. ARTSl mRNA expression and methylation analysis. A. Expression 
of ARTSl by Northern blotting in cancer cell lines show absent or reduced expression in several 
cell lines. B. ARLTSGl expression correlates with the level of methylation of this locus 
analyzed by Southern blotting of digested genomic DNA with Bgin alone or in combination with 
Hpall. The combination BglE+MspI was used to determine the fragment length without respect 
of methylation. The presence or absence of ARTS 1 expression is shown by or 
respectively and the restriction map (Bglll-B-thick vertical lines, Hpall-thin vertical lines) is 
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drawn at the bottom. The position oftheORF probe used is indicated by *. C. Correlation 
betweai ARTSl expression analyzed by RT-PCR and CpG sites methylation analyzed by 
bisulfite sequencing in firesh tumors; white and black rectangles represent unmefhylated and 
hypermethylated CpGs respectively, while gray rectangles represent partially methylated CpG 
sites. As control we use Epstein-Barr Virus transformed lymphoblastoid cell lines. 

[0016] Figure 3. ARTSl suppresses tumorigenicity and A549 cells. A. Restoration of 
ARTS 1 expression by transfection of the minigene into A549, B. Tumor formation in nude 
mice. The weight (mg) of tumors for the five analyzed clones determined at the indicated times 
are shown. The same results were obtained by measurement of tumors. C. Example of 
tumorigenesis in nude mice at 8 weeks after s.c. injection of 10 6 cells. C. Colony growth in 
soft agar (data at 21 days after platuig 5x10 4 cells). 

[0017] Figure 4. Analysis of ARTS 1 expression in human tissues by Northern blotting 
reveals that ARTSl is ubiquitously expressed. 

[0018] Figure 5. Mutation analysis in ARTSl shows the presence of the germline 
polymorphism G446A (Trpl49Stop). The presented sequences are in reverse orientation. For 
identification of the G446A (Trpl49Stop) mutation a r^id assay was developed using the Mael 
site introduced by the mutation. DNA was amplified using primers Mael-Fl (which contains a 
changed base Scorn the wild-type sequence to destroy a constitutive Mael site) and Mael-Rl (for 
sequences of the primers, see Table 4), purified using QIAquick PCR purification kit (QIAGEN) 
and digested with 2U of Mael (Boehringer Manhiem, Geraiany). The amplification of a normal 
allele gives rise to a single 138bp product, while the mutant allele produces two bands of 106 and 
32bp. Note that the digestion has low efficiency and only partial digestion products were 
obtained. Digested PCR products were loaded on 3% agarose gel and visualized using a UV 
imager. N=normal and T=inmor. 

[0019] Figure 6, Both wild-type ARTSl and the truncated ACARLTSl proteins are 
localized in cytoplasm and nucleus. Subcellular localization using ARTSl-GFP fiision protein 
293 cells were transfected with pARLTSl-gQ), pAC-ARLTSl-g§> and control plasmid, 
pEGFPNl. Brigiht field (right) and fluorescence images (left) of the same microscopy field are 
presented. 

10020] Figure 7. The sequence of the cDNA of human ARTSl (SEQ ID NO:l) is 
shown. The GenBank Accession number for the sequence is AF441378- 
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DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

[0021] The invention arises from the identification of ARTSl, a novel member of the 
ADP-ribosylation factor family. ARTSl is located at 13ql4, and displays features characteristic 
of a TSG. ARTSl is downregulated by hyperaiethylation in 25 out of 75 (33%) human primary 
tumors and cell lines analyzed. Furthermore, analysis of 800 tumor and normal DNAs revealed 
the presence of several variants including a germline nonsense polymorphism G446A 
(Trpl49Stop) that is three times more frequent in cancer patients with a family history of cancer 
than in the normal population. Restoration of wild-type ARTSl expression in A549 cells, which 
shows low levels of expression, suppresses tumor formation of A549 cancer cells. 

[0022] The GenBank accession number of the human ARTS 1 cDNA is AF44 1 378. 
During the final stages of the functional studies described below, a clone of L6kb, BC013150, 
containing the ORF of ARTSl and encoding the hypothetical protein FLJ22595 (accession 
number AAH13150) was deposited in the GenBank. 

[0023] The ARTSl gene, and proteins, polypeptides, or peptides encoded by the gene, 
can be used in methods of preventing abnormal cell growth in mammalian subjects. Such 
methods involve administering to a mammal a composition comprising an effective amount of 
liie ARTSl protein. Such methods also involve administering to a manunal a composition 
comprising an expression vector comprising a gene encoding ARTSl. 

[0024] The discovery of ARTSl provides the means to study its fimction as a TSG, to 
design probes and primers to detect its presence and/or to detect mutante, to prepare isolated 
nucleic acid molecules, to insert the nucleic acid molecules that encode ARTSl into vectors such 
as cloning vectors to produce multiple copies, expression vectors usefiil to transform cells that 
will produce the protein and gene therapy vectors which can be used treat patients with tumors 
arising from a lack of endogenous ARTSl function. Antisense compounds may be produced to 
generate tumor cells that lack ARTSl function that can be used in assays to identify compounds 
useful to treat such cancers. Assays and kits can also be provided to identify compounds that 
upregulate or enhance ARTSl activity. Transformed host cells may be used in methods to 
produce ARTSl proteiu. Antibodies can be prepared that specifically bind to ARTSl protein 
and used to isolate or detect the protein including to distinguish wild type from mutants. 

[0025] In certain embodiments, the present invention provides isolated ARTSl protein 
that comprises the amino acid sequence shown in SEQ ID NO:2. Hie ARTSl protein can be 
isolated from natural sources, produced by recombinant DNA methods or synthesized by 
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standard protein synthesis techniques. In other embodiments, the invention relates to ARTSl- 
like polypeptides, which are polypeptides that are similar to, but differ from, the ARTSl 
polypeptide by having at least one amino acid substitution or deletion. For example, 
conservative amino acid substitutions may be made at one or more noness^tial amino acid 
residues of the ARTSl protein to generate ARTSl-like polypeptides. A "nonessential" amino 
acid residue is a residue that can be altered from the wild-type sequence of ARTSl protein (e.g., 
the sequence of SEQ ID NO:2) without altering the biological activity, whereas an "essential" 
amino acid residue is required for biological activity. A "conservative amino acid substitution" is 
one in which the amino acid residue is replaced with an amino acid residue having a similar side 
chain. Families of amino acid residues having similar side chains have been defined in the art. 
These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic 
side chains (e.g., aspartic acid, glutamic acid), imcharged polar side chains (e.g., glycine, 
asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, 
valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side 
chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, 
tryptophan, histidine). Such substitutions would not be made for conserved amino acid residues, 
or for amino acid residues residing within a conserved motif. 

[0026] Antibodies which specifically bind to the ARTSl protein may be used to purify 
the protein from natural sources using well known techniques and readily available starting 
materials. Such antibodies may also be used to purify ARTSl from material present when 
producing the protein by recombinant DNA methodology. As used herein, the term "antibody" 
is meant to refer to complete, intact antibodies, and fragments including Fab fragments and 
F(ab)2 fragments. Complete, intact antibodies include monoclonal antibodies such as murine 
monoclonal antibodies, chimeric antibodies, primatized antibodes and humanized antibodies. 
Antibodies that bind to an epitope present on ARTSl are useful to isolate and purify ARTSl 
from both natural sources or recombinant expression systems using well known techniques such 
as affinity chromatography. Such antibodies are useful to detect the presence of such protein in a 
sample and to determine if cells are expressing the protein. 

[0027] The production of antibodies and the protein stractures of complete, intact 
antibodies, and fragments such as Fab fragments and F(ab)2 fragments and the organization of 
the genetic sequences that encode such molecules are well known and are described, for 
example, in Harlow, E. and D. Lane (1988) ANTIBODIES: A Laboratory Manual, Cold Spring 
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Harbor Laboratory, Cold Spring Harbor, NY. which is incorporated herein by reference. Briefly, 
for example, ARTSl or an immunogenic fragment thereof is injected into mice. The spleen of 
the mouse is removed, the spleen cells are isolated and fused with immortalized mouse cells. 
The hybrid cells, or hybridomas, are cultured and those cells that secrete antibodies are selected. 
The antibodies are analyzed and, if found to specifically bind to ARTSl, the hybridoma which 
produces them is cultured to produce a continuous supply of antibodies, 

[0028] According to some embodiments, the present invention relates to an isolated 
nucleic acid molecule comprising a nucleotide sequence that encodes the amino acid sequence of 
SEQ ID NO:2. Such molecules can be routinely designed using the information set forth in SEQ 
ID NO:2. In certain embodiments, the invention relates to an isolated nucleic acid molecule 
comprising SEQ ID NO:l. Nucleic acid molecules that are fragments of nucleic acid molecules 
comprising a nucleotide sequence that encode the amino acid sequence of SEQ ID NO:2 and of 
nucleic acid molecules comprising SEQ ID NO:l are also encompassed by the present invention. 
By "fragment" is intended a portion of the nucleotide sequence encoding the ARTSl protein or 
an ARTSl -like polypeptide. A fragment of an ARTSl nucleotide sequence may encode a 
biologically active portion of an ARTSl -like protein, or it may be a fragment that can be used as 
a hybridization probe or PGR primer. A biologically active portion of an ARTSl-like protein can 
be prepared by isolating a portion of one of the nucleotide sequences of the invention, expressing 
the encoded portion of the ARTSl-like protein (e.g., by recombinant expression in vitro), and 
assessing the activity of tiie encoded portion of the ARTSl-like protein. Nucleic acid molecules 
that are fragments of an ARTSl-like nucleotide sequence comprise at least about 10, 15, 20, 50, 
75, 100, 200, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1050, 
.1100, 1150, 1200, 1250, 1300, 1350, 1400, 1450, 1500, 1550, 1600, 1650, 1700, 1750, 1800, 
1850, 1900, 1950 nucleotides, or up to the number of nucleotides present in a fiill-length 
ARTSl-like nucleotide sequence disclosed herein (for example, up to 3 791 nucleotides for SEQ 
ID NO:l), depending upon the intended use. Nucleic acid molecules that are variants of the 
ARTSl nucleotide sequences disclosed herein are also encompassed by the present invention. 
"Variants" of the ARTSl nucleotide sequences include those sequences that encode the ARTSl 
protein or ARTSl-like polypeptides disclosed herein but that differ conservatively because of the 
degeneracy of the genetic code. These naturally occurring allelic variants can be identified with 
the use of well-known molecular biology techniques, such as polymerase chain reaction (PGR) 
and hybridization techniques as outlined below. Variant nucleotide sequences also include 
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synthetically derived nucleotide sequences that have been generated, for example, by using site- 
directed mutagenesis but which still encode the ARTSl-hke proteins. Generally, nucleotide 
sequence variants of the invention will have at least about 45%, 55%, 65%, 70%, 75%, 80%, 
85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:l. 

[0029] Using standard techniques and readily available starting materials, a nucleic acid 
molecule that encodes ARTSl may be isolated from a cDNA library, using probes or primers 
which are designed using flie nucleotide sequence information disclosed in SEQ ID NO: 1 . In 
some embodiments, the nucleic acid molecules comprise the nucleotide sequence that consists of 
the coding sequence in SEQ ID NO:l. In some embodiments, the nucleic acid molecules consist 
of the nucleotide sequence set forth in SEQ ID NO:l. The isolated nucleic acid molecules of the 
invention are useful to prepare constructs and recombinant expression systems for preparing 
ARTSl. 

[0030] A cDNA library may be generated by well-known techniques. A cDNA clone 
which contains one of the nucleotide sequences set out is identified using probes that comprise at 
least a portion of the nucleotide sequence disclosed in SEQ ID NO: 1. The probes have at least 
16 nucleotides, preferably 24 nucleotides. The probes are used to screen the cDNA library using 
standard hybridization techniques. Alternatively, genomic clones may be isolated using genomic 
DNA from any human cell as a startmg material. In certain embodiments, the present invention 
relates to isolated nucleic acid molecules that comprise a nucleotide sequence identical or 
complementary to a fragment of SEQ ID NO: 1 which is at least 10 nucleotides. In some 
embodiments, the isolated nucleic acid molecules consist of a nucleotide sequence identical or 
complementary to a fragment of SEQ ID NO:l which is at least 10 nucleotides. In some 
embodiments, the isolated nucleic acid molecules comprise or consist of a nucleotide sequence 
identical or complementary to a fragment of SEQ ID NO:l which is 15-150 nucleotides. In 
some embodiments, the isolated nucleic acid molecules comprise or consist of a nucleotide 
sequence identical or complementary to a fragment of SEQ ID NO: 1 which is 15-30 nucleotides. 
Isolated nucleic acid molecules that comprise or consist of a nucleotide sequence identical or 
complementary to a fragment of SEQ ID NO:l which is at least 10 nucleotides are usefiU as 
probes for identifying genes and cDNA sequence having SEQ ID NO:l, PGR primers for 
ampUfying genes and cDNA having SEQ ID NO:l, and antisense molecules for inhibiting 
transcription and translation of genes and cDNA, respectively, which encode ARTSl having the 
amino acid sequence of SEQ ID NO:2, 
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[0031] The cDNA that encodes ARTSl may be used as a molecular marker in 
electrophoresis assays in which cDNA from a sample is separated on an electrophoresis gel and 
ARTSl probes are used to identify bands which hybridize to such probes. Specifically, SEQ ID 
NO: 1 or portions thereof^ may be used as a molecular marker in electrophoresis assays in which 
cDNA firom a sample is separated on an electrophoresis gel and ARTSl specific probes are used 
to identify bands which hybridize to them, indicating that the band has a nucleotide sequence 
complementary to the sequence of the probes. The isolated nucleic acid molecule provided as a 
size marker will show up as a positive band that is known to hybridize to the probes and thus can 
be used as a reference point to the size of cDNA that encodes ARTSl. Electrophoresis gels 
usefiil in such an assay include standard polyacrylamide gels as described in Sambrook et al.. 
Molecular Cloning a Laboratory Manual, Second Ed. Cold Spring Harbor Press (1989) which is 
incorporated herein by reference. 

[0032] The nucleotide sequences in SEQ ID N0:1 may be used to design probes, 
primers and complementary molecules which specifically hybridize to the imique nucleotide 
sequences of ARTSL Probes, primers and complementary molecules which specifically 
hybridize to nucleotide sequence that encodes ARTSl may be designed routinely by those 
having ordinary skill in the art. As used herein, the term "q)ecifically hybridize to nucleotide 
sequence that encodes ARTSl" is meant to refer to nucleic acid molecules with unique 
nucleotide sequences that hybridize to ARTSl encoding sequences but not other known proteia 
encoding sequences, such as sequences identical to portions of SEQ ID NO:l. This, ttie unique 
sequences described h^ein are those fliat do not ovCTlap with known sequences. 

[0033] The present invoition also includes labeled oligonucleotides that are usefiil as 
probes for performing oligonucleotide hybridization methods to identify ARTS 1 . The 
oligonucleotides include sequences that specifically hybridize to nucleotide sequences that 
encode ARTSL Accordingly, the present invention includes probes that can be labeled and 
hybridized to unique nucleotide sequences that encode ARTSL The labeled probes of the 
present invention are labeled with radiolabelled nucleotides or are otherwise detectable by 
readily available nonradioactive detection systems. In some preferred embodiments, probes 
comprise oligonucleotides consisting from 10 to 100 nucleotides. In some preferred 
embodiments, probes comprise oligonucleotides consisting of from 10 to 50 nucleotides. Jn 
some preferred embodiments, probes comprise oligonucleotides consisting of from 12 to 20 
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nucleotides. The probes preferably contain nucleotide sequence completely identical or 
complementary to a fragment of a unique nucleotide sequences of ARTS 1. 

[0034] PGR technology is practiced routinely by those having ordinary skill in the art 
and its uses in diagnostics are well known and accepted. Methods for practicing PGR technology 
are disclosed in 'TPCR Protocols: A Guide to Methods and Applications'*, Innis, M.A., et al. Eds. 
Academic Press, Inc. San Diego, CA (1990) which is incorporated herein by reference. 
Applications of PGR technology are disclosed in '^Polymerase Chain Reaction" Erlich, H.A., et 
al., Eds. Gold Spring Harbor Press, Gold Spring Harbor, NY (1989) which is incorporated herein 
by reference. Some simple rules aid in tiie design of efficient primers. Typical primers are IS- 
IS nucleotides in length having 50% to 60% g+c composition. The entire primer is preferably 
complementary to the sequence it must hybridize to. Preferably, primers generate PGR products 
having from 100 base pairs to 2000 base pairs. However, it is possible to generate products of 50 
base pairs to up to 10 kb and more. 

[0035] PGR technology allows for the rapid generation of multiple copies of nucleotide 
sequences by providing 5* and 3' primers that hybridize to sequences present in a nucleic acid 
molecule, and further providing free nucleotides and an enzyme which fills in the 
con[q)lementary bases to the nucleotide sequence between the primers with the free nucleotides to 
produce a complementary strand of DNA. The enzyme will fill in the complementary sequences 
adjacent to the primers. If both the 5* primer and 3' primer hybridize to nucleotide sequences on 
the complementary strands of the same fi:agment of nucleic acid, exponential amplification of a 
specific double-stranded product results. If only a single primer hybridizes to the nucleic acid 
molecule, linear amplification produces single-stranded products of variable length. PGR 
primers include at least one primer which includes a nucleotide sequence that specifically 
hybridizes to nucleotide sequence that encodes ARTSl. 

[0036] One havmg ordinary skill in the art can isolate the nucleic acid molecule that 
encode ARTSl and insert it into an expression vector using standard techniques and readily 
available starting materials. 

[0037] The present invention relates to a recombinant expression vector that comprises 
a nucleotide sequence that encodes ARTSl that comprises the amino acid sequence of SEQ ID 
NO:2. As used herein, the term "recombinant expression vector^' is meant to refer to a plasmid, 
phage, viral particle or other vector which, when introduced into an appropriate host, contains 
the necessary genetic elements to direct expression of the coding sequence that encodes the 
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ARTSl of the invention. The coding sequence is operably Unked to the necessary regulatory 
sequences. Expression vectors are well known and readily available. Examples of expression 
vectors include plasmids, phages, viral vectors and other nucleic acid molecules or nucleic acid 
molecule containing vehicles useful to transform host cells and facilitate expression of coding 
sequences. In some embodiments, the recombinant expression vector comprises the nucleotide 
sequence set forth in SEQ ID NO: 1. The recombinant expression vectors of the invention are 
useful for transforming hosts to prepare recombinant expression systems for preparing ARTSl . 

[0038] The present invention relates to a host cell that comprises the recombinant 
expression vector that includes a nucleotide sequence that encodes ARTSl that comprises SEQ 
ID NO:l. In some embodiments, the host cell comprises a recombinant expression vector that 
comprises SEQ ID NO: 1 . Host cells for use in well known recombinant expression systems for 
production of proteins are well known and readily available. Examples of host cells include 
bacteria cells such as E. coli, yeast cells such as S. cerevisiae, insect cells such as S. frugiperda, 
non-human mammalian tissue culture cells Chinese hamster ovary (CHO) cells and human tissue 
culture cells such as HeLa cells, 

[0039] The present invention relates to a transgenic non-human mammal that comprises 
the recombinant expression vector that comprises a nucleic acid sequence tiiat encodes ARTSl 
that comprises the amino acid sequence of SEQ ID NO:2. Transgenic non-human mammals 
usefiil to produce recombinant proteins are well known as are the expression vectors necessary 
and the techniques for generating transgenic animals. Generally, the transgenic animal 
comprises a recombinant expression vector ia which the nucleotide sequence that encodes 
ARTSl is operably linked to a mammary cell specific promoter whereby the coding sequence is 
only expressed in mammary cells and the recombinant protein so expressed is recovered firom the 
animal's milk. In some embodiments, the coding sequence that encodes ARTSl is SEQ ID 
NO:l. 

[0040] In some embodiments, for example, one having ordinary skill in the art can, 
using well known techniques, insert such DNA molecules into a commercially available 
expression vector for use in well known expression systems. For example, the commercially 
available plasmid pSE420 (Invitrogen, San Diego, CA) may be used for production of collagen 
in E. coli. The commercially available plasmid pYES2 (Invitrogen, San Diego, CA) may, for 
example, be used for production in S. cereAdsiae strains of yeast. The commercially available 
MAXBAC™ complete baculovirus expression systraa (Invitrogen, San Diego, CA) may, for 
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example, be used for production in insect cells. The commercially available plasmid pcDNA I 
(Invitrogen, San Diego, CA) may, for example, be used for production in mammalian cells such 
, as Chinese Hamster Ovary cells. One having ordinary skill in the art can use these commercial 
expression vectors and systems or others to produce Caspase-1 using routine techniques and 
readily available starting materials. (See e.g., Sambrook et al.. Molecular Cloning a Laboratory 
Manual, Second Ed. Cold Spring Harbor Press (1989) which is incorporated herein by 
reference.) Thus, the desired proteins can be prepared in both prokaryotic and eukaryotic 
systems, resulting in a spectrum of processed forms of the protein. 

[0041] One having ordinary skill in the art may use other commercially available 
CTqpression vectors and systems or produce vectors using well known methods and readily 
available starting materials. Expression systems containing the requisite control sequences, such 
as promoters and polyadenylation signals, and preferably enhancers, are readily available and 
known in the art for a variety of hosts. See e.g., Sambrook et al.. Molecular Cloning a 
Laboratory Manual, Second Ed. Cold Spring Harbor Press (1989). 

[0042] A wide variety of eukaryotic hosts are also now available for production of 
recombinant foreign proteias. As in bacteria, eukaryotic hosts may be transformed with 
e3q)ression systems which produce the desired protein directly, but more commonly signal 
sequences are provided to effect the secretion of the protein, Eukaryotic systems have the 
additional advantage that they are able to process introns which may occur in the genomic 
sequences encoding proteins of higher organisms. Eukaryotic systems also provide a variety of 
processing mechanisms which residt in, for example, glycosylation, carboxy-terminal amidation, 
oxidation or derivatization of certain amiao acid residues, conformational control, and so forth. 

[0043] Commonly used eukaryotic systems include, but are not limited to, yeast, fungal 
cells, insect cells, mammalian cells, avian cells, and cells of higher plants. Suitable promoters 
are available which are compatible and operable for use in each of these host types as well as are 
termination sequences and enhancers, e.g. the baculovirus polyhedron promoter. As above, 
promoters can be either constitutive or inducible. For example, in mammalian systems, the 
mouse metallothionein promoter can be induced by the addition of heavy metal ions. 

[0044] The particulars for the constmction of expression systems suitable for desired 
hosts are known to those in the art. ' Briefly, for recombinant production of the protein, the DNA 
encoding the polypeptide is suitably ligated into the expression vector of choice. The DNA is 
operably linked to all regulatory elements which are necessary for expression of the DNA in the 
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selected host. One having ordinary skill in the art can, using well known techniques, prepare 
expression vectors for recombinant production of the polypeptide. 

[0045] The expression vector including the DNA that encodes the Caspase-1 is used to 
transform the compatible host that is then cultured and maintained under conditions wherein 
expression of the foreign DNA takes place. The protein of the present invention thus produced is 
recovered from the culture, either by lysing the cells or from the culture medium as appropriate 
and known to those in the art. One having ordinary skill in the art can, using well known 
techniques, isolate ARTSl that is produced using such expression systems. The methods of 
piuifying ARTSl from natural sources using antibodies which specifically bind to ARTSl as 
descxribed above, may be equally applied to purifying ARTSl produced by recombinant DNA 
methodology. 

[0046] Examples of genetic constructs include ARTSl coding sequence operably 
linked to a promoter that is functional in the cell line into which the constructs are transfected. 
Examples of constitutive promoters include promoters from c)rtomegalovirus or SV40. 
Examples of inducible promoters include mouse mammary leukemia virus or metallothionein 
promoters. Those having ordinary skill in the art can readily produce genetic constructs useful 
for transfecting with cells with DNA that encodes ARTSl from readily available starting 
materials. Such gene constructs are useful for the production of ARTSl. 

[0047] In some embodiments of the invention, transgenic non-human animals are 
generated. The transgenic animals according to the invention contain SEQ ID N0:1 under the 
regulatory control of a mammary specific promoter. One having orclinary skill in the art using 
standard techniques, such as those taught in U.S. Patent No. 4,873,191 issued October 10, 1989 
to Wagner and U.S. Patent No. 4,736,866 issued April 12, 1988 to Leder, both of which are 
incorporated herein by reference, can produce transgenic animals which produce ARTS 1 . 
Preferred animals are rodents, particularly rats and mice, and goats, 

[0048] In addition to producing these proteins by recombinant techniques, automated 
peptide synthesizers may also be employed to produce ARTS 1 . Such techniques are well known 
to those having ordinary skill in the art and are usefiil if derivatives that have substitutions not 
provided for in DNA-encoded protein production. 

[0049] One aspect of the invention relates to gene therapy, specifically "gene 
replacement." Gene replacement" refers to the replacement of a mutated genetic element with a 
normal gene. The present invention provides methods of gene therapy that is a "gene 
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replacement" therapy. Generally the present gene replacement method involves inhibition of an 
abnormal ARTS-l product coupled with replacement with the normal ARTS-1 gene. Generally, 
methods of the present invention can be used to treat conditions associated with tumorogenesis 
related to a lack of or insufiScient amount of functional wild type ARTS-1. Methods of the 
present invention may be used to replace the abnormal ARTS-1 gene with a normal ARTS-1 
gene. 

[0050] By normal ARTS-1 gene is meant any gene which, when encoded produces a 
biologically active, wild-type tumor suppressing ARTS-1 protein. By abnormal or mutant gene 
is meant any gene which, when encoded, does not produce a biologically active, wild-type 
ARTS-1 protein and /or is insufficiently present to perform a tumor suppression function. 

[0051] The term "DNA construct" as used herein refers to any DNA molecule which 
has been modified such that the nucleotide sequences in the molecule are not identical to a 
sequence which is produced naturally. 

[0052] The term "expression vector", as used herein, is defined as a DNA construct 
which includes an autonomous site of replication, a site of transcription initiation, and at least 
one structural gene coding for a protein which is to be expressed in a host organism. The 
expression vector will usually also contain appropriate control regions such as a promoter and 
terminator which control the expression of the protein in the host organism. Expression vectors 
of the present invention may include retroviral vectors such as the "double copy*' vector. As one 
skilled in the art would recognize, the particular vector chosen depends partly upon the cell-type 
targeted. 

[0053] In preferred embodiments of the present invention the ejqiression vector 
includes a promoter. Vectors encoding one or more ribo2ymes should preferably utiUze a strong, 
RNA polymerase m type promoter. Useful promoters include, but are not limited to tRNA and 
SV40 promoters. Expression vectors of the present invention may also include homologous 
sequences with a host gene to provide for integration of the modified gene into the chromosome 
of the host. 

[0054] The term *T>ifunctional expression vector" as used herein is defined as an 
expression vector which contains at least one structural gene cassette coding for a protein which 
is to be expressed in a host organism and a regulatory cassette coding for a regulatory element. 
The regulatory cassette may code for any element which functions within the cell to inhibit flie 
expression of one or more genes. In accordance with preferred embodiments of the present 
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invention the regulatory cassette codes for an RNA fragment having ribozyme activity effective 
to cleave a separate RNA molecule. 

[0055] Cassette, as used herein, refers to a discrete DNA fragment that encodes a 
control region and a DNA sequence of interest such a structural protein. 

[0056] The term "plasmid" is used herein in accordance with its commonly accepted 
meaning, i.e. autonomously replicating, usually close looped, DNA. 

[0057] **Ribozyme" as the term is used herein, refers to an enzyme which is made of 
RNA, Ribozymes are involved in the cleavage and/or ligation of RNA chains. In preferred 
embodiments of tiie present invention, "hammerhead ribozymes" are used. As described above, 
hammerhead ribozymes cleave the phosphodiester bond of a target RNA downstream of a GUX 
triplet where X can be C, U, or A. Hammerhead ribozymes used in methods of the present 
invention have a structural domain having the sequence 3'- 

CAAAGCAGGAGCGCCUGAGUAGUC-5' (SEQ ID NO:3) . Site specific regulatory elements 
such as site specific ribozymes are provided in accordance with the present invention. The 
ribozyme regulatory element is made site specific, having the sequence 3'-Xn- 
CAAAGCAGGAGCGCCUGAGUAGUC-Ym-5' ((SEQ ID NO:4), reported in 5' to 3' 
direction) where X and Y are complementary to regions of the target mRNA flanking the GUC 
site and n-hn are generally from about 20 to about 35 RNA bases in length, n-hn need not be of 
equal lengths although it is preferable that neither n nor m is less than about 10. 

[0058] Hammerhead ribozymes target the triplet GUC. For a gene of interest a target 
site can be identified by analyzing the gene sequence to identify GUC triplets. Computer 
analysis of secondary structure may assist in site selection. Demnan, (1993), Biotechniques, 15, 
1090-1094. 

[0059] Vectors of the present invention may be delivered to a patient via methods 
known in the art. Retroviral mediated delivery is particularly preferred in some embodiments of 
the invention. In vivo delivery by of retroviral vectors may be achieved, for example by i.v. 
injection of the retroviral vectors. A double balloon catheter may also be used for direct delivery 
of retroviral vectors to the patient. 

[0060] According to one aspect of the invention, compounds may be screened to 
identify compounds that inhibit or enhance Caspase-1 activity. Substrates of Caspase-1 include 
baculovims protein p35 and the Sf immimophillin FiaBP46. Assays may be performed 
combining Caspase-1 with a substrate in the presence or absence of a test compound. The level 
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of Caspase-1 activity in the presence of the test compound is compared to the level in the 
absence of the test compound. If Caspase-1 activity is increased by the presence of the test 
compound, the test compound is an enhancer. If Caspase-1 activity is decreased by the presence 
of the test compound, the test compound is an inhibitor. In some embodiments of the invention, 
the preferred concentration of test compound is from IjiM to SOOjiM* A preferred concentration 
is from 10M.M to 100(xM. In some preferred embodiments, it is desirable to use a series of 
dilutions of test compounds. 

[0061] Kits are included which comprise containers with reagents necessary to screen 
test compounds. Such kits include a container with Caspase-1 protein, a container with a 
substrate such as FKBP46 or p35, which is preferably a labeled substrate, and instructions for 
performing ftie assay. Kits may include a control inhibitor such as anti-Caspase-1 neutralizing 
antibodies. 

[0062] Combinatorial libraries may be screened to identify compounds that enhance or 
iohibit Caspase-1 actiAdty. 

EXAMPLES 

[0063] EXOFISH (Roest CroUius, et al. Estimate of human gene number provided by 
genome-wide aivalysis using Tetaodon nigroviridis DNA s^equence. Nat Genet 25, 235-8. (2000), 
which is incorporated herein by reference) was used to scan 1 .4 Mb of assembled genomic 
sequence at chromosome 13ql4 OVfabuchi, H. et al. Cloning and characterization of CLLD6, 
CLLD7, and CLLD8, novel candidate genes for leukemogenesis at chromosome 13ql4, a region 
commonly deleted B-cell chronic lymphocytic leukemia. Cancer Res 61, 2870-7. (2001), 
BiiUrich, F. et al. Characterization of the 13ql4 tumor suppressor locus in CLL: identification of 
ALTl, an altemative splice variant of the LEU2 gene. Cancer Res61. 6640-8, (2001), Lander, 
E.S. et al. Initial sequencing and analysis of the human genome. Nature 409, 860-921. (2001), 
and Venter, J.C. et al. The sequence of the human genome. Science 291, 1304-51. (2001) which 
are each incorporated herein by reference) for putative genes. A 182 bp *ecore' (evoluntionary 
conserved region) coding for an aminoacidic sequence with high homology to several members 
of the ADP-ribosylation factor family was found. By using EST walking and RACE, the 
corresponding full-length cDNA was obtained. Comparison with the genomic sequence 
indicated that the cloned cDNA, which was designated ARTSl (for ADP-Ribosylation factor- 
Like, putative Tumor Suppressor gene 1), derives from a small gene composed of two exons 
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separated by a 1 .8 kb intronic sequence and spanning about 6 kb of DNA. Using LOH analysis it 
was found that this region was heterozygously deleted in a fraction of tumors between 10% 
(colon cancers) and 20% (B-CLL). The putative ORF, within the second exon, encodes a 196- 
amino acid protein with a predicted molecular mass of 21 kDa. BLAST analysis and Conserved 
Domain search of protein databases revealed highly significant homology with the ADP- 
ribosylation factor (ARF) and ARF-like (ARL) protein subfamily of the ras family (Moss, J, & 
Vaughan, M. Molecules in the ARF orbit J Biol Chem 273, 21431-4. (1998), and Kahn, R.A., 
Der, CJ. 8l Bokoch, G,M. The ras superfamily of GTP-binding proteins: guidelines of 
nomenclature. Faseb J 6, 2512-3 (1992), which are each incorporated herein by reference). At 
the protein level, related proteins share at most 45% identical amino acids. A multiple alignment 
with the CLUSTALW program indicates that ARTSl belongs to the subgroup formed by ARL4 , 
ARL6 and ARL7 (Jacobs, S. et 1. ADP-ribosylation factor (ARF)-like 4, 6, and 7 represent a 
subgroup of the ARF family characterized by rapid nucleotide exchange and nuclear localization 
singnal. FEES Lett 456, 384-8.(1999)) (Figure 1). 

[0064] Northem analysis of normal human tissues with an ARTSl probe revealed 
ubiquitous expression of a 2.2 kb transcript. In some tissues, two additional minor bands of 
approximately 1.3 and 5,5 kb were detected resulting from the use of different polyadenylation 
sites (Figure 4). The expression of ARTS 1 was analyzed by Northem blot and/or 
semiquantitative RT-PCR in a set of 59 hematopoietic and solid tumor cell lines. ARTSl 
expression was significantly reduced or absent in 22% (7/32) of blood cancer cell lines, 78% 
(7/9) of lung cancer cell lines, 33% (2/6) of esophageal cancer cell lines and 22% of pancreatic 
cancer cell lines as well as in HeLa S3 (cervical carcinoma), SW 480 (colorectal cancer) and G- 
361 (melanoma) cell lines. In addition, 4 out of 16 fresh tumor samples (25%, 2/7 Ixmg 
carcinomas and 2/9 B-CLL) for which cDNA and/or RNA were available showed reduction or 
absence of ARTSl expression when compared to their normal tissue coimterparts (Figure 2 and 
Table 5). 

[0065] The possibility that, as occurs with other cancer-related genes such as TSCLl 
(Kuramochi, M. et al. TSLCl is a tumor-suppressor gene in human non-small-cell lung cancer. 
Nat Genet 27, 427-30. (2001) which is incorporated herein by reference) or pl6 (Merlo, A. et al. 
5' CpG island methylation is associated with transcriptional silencing of the tumour suppressor 
pl6/CDKN2/MTSl in human cancers. Nat Med 1, 686-92. (1995) which is incorporated herein 
by reference), ARTSl is downregulated through hypermethylation of the putative promoter was 
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examined. First, the global methylation level around ARTSl was analyzed by Southern blotting 
using cell lines for which expression data was available. The level of expression is coirelated 
with the methylation status of the genomic region - cell lines with low or no ARTSl expression 
are highly methylated, while cell lines with normal levels of e3q)ression display only one 
methylated site (Figure 2). ARTSl DNA methylation patterns were examined in more detail 
through bisulfite sequencing to determine the methylation status of 5 CpG sites near the putative 
promoter sequences. Fresh tumor samples and tumor cell lines with low or absent ARTSl 
expression showed higher methylation levels than normal tissues or tumors with normal 
expression levels (Figure 2 and Table 5). 

[0066] During an initial mutation screening with 80 cell lines (including 70 used for 
gene expression and 10 melanoma cell lines), three mutations have been identified. The first, a 
missense mutation G446A (Trpl49Stop) is present in homozygosity in the MCF7 breast cell line 
and in heterozygosity in the HS776T pancreatic carcinoma cell Une. Two heterozygous 
substitutions were identified in melanoma cell lines: a T50C (Metl7Thr) and a C262A 
0-eu88Met) in one of the patients with T50C (Table 1). 

[0067] In order to establish the significance of these mutations three panels of samples 
were screened (Methods and Table 6). The first includes 216 human tumors that were screened 
by direct sequencing of the ARTSl ORF. Eight cases carried the G446A (Trpl49Stop) 
mutation, including 3 breast cancers (3/48 of cases, 6.25%), 2 colorectal carcinomas (2/58, 
3.45%), 1 lung carcinoma (1/5, 20%), 1 fliyroid tumor (1/65, 1,5%) and one idiopathyc 
pancytopenia. All tumor samples had both the wild-type and mutant alleles except for a breast 
tumor with LOH at the ARTSl locus, which was homozygous for the mutation. Sequencing of 
the ARTSl in paired normal tissues, which were available for three out of six tumors, revealed 
the same alteration in the germline of patients. 

[0068] The second panel contains 109 blood DNAs from patients with multiple cancers 
or with a family history of cancer screened by direct sequencing. Six additional cases with the 
G446A (Trpl49Stop) were identified - 2 malignant melanomas + prostate carcinoma cases (2/17, 
1 1.75%), 2 cases of familial CLL (2/17, 1 1.75%), 1 case of pancreatic + melanoma (1/6, 16.5%) 
and 1 breast cancer (1/69 of cases, 1 .5%) (see Table 2 for family history). At the protein level, 
the stop codon inserts a premature temiination 48 amino acids before the C-terminus, leading to 
the synthesis of a smaller protein with 148 mstead of 196 amino acids (Fig. 1). Thus, the 
truncated protein lacks the C-terminus motif presumably involved in nucleotide binding and 

-18- 



wo 2004/033659 



PCT/US2003/032270 



hydrolysis characteristic of Ras related GTPases, one of the five additional amino acids typical 
of the ARF subfamily (Glyl61) and the putative nuclear localization signal. Furthermore, 
Tip 149, the site of the mutation is conserved in AKL4 and in 1 1 other ARF or ARF related genes 
including all six ARF genes. 

[0069] The third panel comprises the case-controls: allele frequency for the G446A 
(Trpl49Stop) mutation in three separate Caucasian cohorts was 2.10%, with variations between 
0.86% (1/1 16) in the US population and 3.44% (7/203) in the Italian population. Overall, 14 
patients out of 325 analyzed (4.63%) and 10 out of 475 normal controls (2.1%) had the stop 
mutation. The odds of G446A (Trpl49Stop) were 2.10 (95% CI 0.92 - 4.77) times higher in 
cancer patients versus controls. After stratification upon family history of cancer, this odds 
increase in the group with positive family history to 2.70 (95% CI 0.85 - 8.32) (Table 6). In 
addition to the G446A (Trpl49Stop), several other variants in the ARTSl gene were identified 
including a G490A (Glul64Lys) substitution in a thyroid adenoma (Table 1). Four mutations in 
a total of 64 thyroid adenomas and carcinomas analyzed were found (two C65T missenses, one 
G446A nonsense and one G490A missense). All four mutations were found in adenomas of 
follicular origin, whereas all samples of non-follicularhystotype (42/65, 65%) were wild-type. It 
is highly unlikely that this allelic distribution is random (P=0.01 at Fisher exact test). Also, a 
G446A homozygous patient in a family with CLL has thyroid adenoma (Table 2). Taken 
together, these observations raise the possibility that this ARTSl is involved in a portion of 
thyroid tumors with follicular histotype. 

[00701 ARTSl appears to be the first ARF family member reported to be altered in 
human cancers. Because of their nuclear localization signal (NLS), ARL4, ARL6 and ARL7 
appear to be cargo molecules transported via ttie translocators importin-i and a in the nucleus 
where they have yet unknown fimctions. Of note, ARTSl lacks a classical NLS at its C- 
tenninus, and probably contains an atypical NLS. Using GFP constructs, the wild-type ARTSl 
protein was shown to be localized both in the nucleus and in the c5^oplasm. The mutant ARTSl 
AC-terminus protein has the same iatracellular protein (Figure 6). ARTSl may be involved in 
novel cytoplasmic/nuclear membrane trafScking and/or signaling cascades that are important in 
different types of cells. 

[0071] Northern and RT-PCR expression data showed that ARTS 1 expression was 
dramatically decreased in A549, a highly tumorigenic non-small cell lung carcinoma (NSCLC) 
cell line(Fogh, J., Fogh, J.M. & Orfeo,T. One hundred and twenty-seven cultured human tumor 
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cell lines producing tumors in nude mice. J Natl Cancer Inst 59, 221-6 (1977) which is 
incorporated herein by reference) when compared to the level foxmd in normal lung. The ARTSl 
ORF under the control of the LTR promoter was transfected into A549. Several stable clones 
were obtained and five of them were used in experiments: parental A549, the A549-pMV-7 
(empty vector) clone, and three neomycin-resistant transfectants, selected according to the level 
of expression of the transfected ARTSl minigene (Figure 3). To evaluate the biological effect of 
ARTSl in vitro and in vivo, tumorigenicity was examined by soft agar and in Nu/Nu nude mice 
(Figure 3). All three transfected clones give rise to smaller colonies with a shorter survival in 
comparison with the parental cell or cells transfected with the empty vector. Furtheraiore, during 
10 weeks of observation after the s.c. injection, the former consistently formed smaller, 
nonprogressive tumors, while the latter formed large, progressively growing tumors in nude 
mice. Thus, ARTSl by itself has significant tumor-suppressor activity in A549 cells. 

[0072] The presence of a new tumor suppressor within the well-characterized 
superfamily of Ras oncogenes is not as contradictory. It was recently shown that wild type 
Kras2 could inhibit lung carcinogenesis in mice, clearly illustrating the tumor suppressor role of 
the gene m lung tumorigenesis (Zhang, Z. et al. Wildtype Rras2 can mhibit lung carcinogenesis 
in mice. Nat genet 29. 25-33. (2001) which is incorporated herein by reference). The principal 
mechanism for ARTSl inactivation in human cancers is biallelic methylation, as was proposed in 
the revised Knudson's two hit hypotheses (Jones, P.A. & Laird, P.W. Cancer epigenetices comes 
of age. Nat Genet 21, 163-7. (1999) which is incorporated herein by reference). One intriguing 
aspect of ARTSl involvement in human cancer is the real significance of the G446A 
(Trpl49Stop) nonsense mutation. Because the frequency of G446A mutation is about three 
times higher in fanulial cancers as in the general population and about two times higher as in 
sporadic cancers, one possible explanation is that ARTSl gennline mutations have low 
penetrance and are associated with a small percentage of familial melanoma or familial CLL 
cancers (which harbor a ten times higher frequency of the truncating mutation as in the same 
population control group). According to this, it is possible that there exists kindreds which carry 
the mutation but did not develop cancer. The same is tme also for some other TSGs as is the 
case of BRCA2 germline mutations in breast and pancreatic cancers (Goggins, M. et al. 
Gennline BRCA2 gene mutations in patients with apparently sporadic pancreatic carcinomas. 
Cancer Res 56, 5360-4. (1996) which is incorporated herem by reference). An alternative 
explanation is that this truncating mutation does not have a pathograetic role m human cancers, 
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because the lost domains are not important for tumorigenesis or because the protein has 
redundant functions with other ARL family members. Until now, only one polymorphic stop 
codon was identified in cancer related genes, the Lys3326ter in BRCA2 gene (Mazoyer, S. et al. 
A polymorphic stop condon in BRCA2. Nat Genet 14, 253-4. (1996) which is incorporated 
herein by reference). However, xmtil independent groups analyze a larger number of cases, the 
possibility that such polymorphisms are associated with a modest increased cancer risk or are 
associated with other phenotypes in the hetero2ygous or homozygous state cannot be excluded. 
Methods 

[0073] Cell Lines. Eighty cell lines derived firom human tumors were used in this 
study. Forty-four were hematopietic cancer cell lines and 36 were solid tumors cell lines (for 
detailed list, see Table 3). As controls, six lymphoblastoid cell lines made fi'om peripheral blood 
lymphocj^es of patients with Alzheimer's disease by transformation with Epstein Barr Virus 
(EBV) were used. All the cell lines were obtained from the American Type Culture Collection 
(ATCC) (Manassas, VA) and maintained according to ATCC instructions. 

[0074] Patient Samples. Experimental samples were derived from sporadic tumors or 
from peripheral blood of patients with familial cancer (total of 325). Control samples were 
derived from blood of patients with diseases other than cancar or from healthy individuals (total 
of 475). All samples were obtained with informed consent following institutional guidelines for 
the protection of human subjects. The 216 human sporadic tumors analyzed include 65 thyroid 
tumors, 58 colorectal adenocarcinomas, 48 breast carcinomas, 39 B-CLLs, 5 lung carcinomas 
and 1 idiopathyc pancytopenia. The panels of DNA from blood include: a) 69 DNA samples 
from females with BRCA-1 - and BRCA-2-negative familial breast cancer; b) 17 DNA samples 
from males afiTected with prostate cancer and malignant melanoma which had been found 
negative for mutations at the pl6 locus; c) 17 DNAs from patients with familial CLL (at least 
two first-degree relatives affected) and d) 6 DNAs from individuals with pancreatic cancer or 
melanoma who have a family history of at least one case of melanoma or pancreatic cancer and 
negative for mutations in the pl6 and pl4 genes. Patients' profile was similar for both groups: 
about 60% of cancer patients were from European Caucasian origin and the remaining 40% were 
from US persons. In the control group the proportions of the two cohorts were 75% and 25%, 
respectively. No bias toward distinct population groups (such as Ashkenazim) was noted. High 
molecular weight (HMW) DNA was extracted by conventional protocols (Sambrook, J., Frisch, 
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E.F. & Mardatis, T. Molecular cloning: A Laboratory Manual, (Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1989) .which is incorporated herein by reference). 

[0075] Rapid Amplification of cDNA Ends (RACE). The 3 ' and 5 ' ends of mRNAs 
were obtained by RACE ifrom human testis, fetal liver, bone marrow and lymph node, using the 
Marathon-ready and the SMART RACE protocols (Clontech, Palo Alto, CA). The PGR 
products were separated on 1.0-2,0% agarose gels and gel purified using the QIAquick gel 
extraction kit (QL^GEN) or cloned in TA vector using TOPO TA Cloning (Invitrogen Carlsbad, 
CA) and sequenced. 

[0076] Northern blot analysis. Human multiple tissues Northern blots were purchased 
from Clontech and total RNA was extracted from tumor cell lines or tumors by the QIAGEN 
RNeasy mini kit (QIAGEN) according to the manufacturer's protocol. The membranes were 
hybridized with a 443-bp probe containing the majority of the ARTSl open reading frame (ORF) 
labeled with ^^P dCTP by random priming (Prime-it 11 Kit, Stratagene). Prehybridization and 
hybridization were carried out in Church Buffer (7% SDS, 0.5M phosphate buffer pH 7.2, 10 
mM EDTA) for 18-20 h at 65^C as described in Sambrook SUPRA. 

[0077] Reversed Transcription PCR (RT-PCR) analysis. The DNA sequence was 
confirmed by RT-PCR and a semiquantitative RT-PCR was performed to analyze tiie levels of 
gene expression in differmt normal and tumor tissues. Five microliters of cDNA were used for 
each PCR with Advantage2 PCR kit (Clontech) and 10 pmol of each gene-specific primer for 35 
cycles of 94°C for 20 s, 65^C for 30 s, 68°C for 1 min (for a complete list of primers used in this 
study, see Table 4). To ensure that the RNA was of sufficimt purity for RT-PCR, a PCR assay 
with primers specific for GAPDH cDNA (Clontech) was used. Semiquantitative PCR was 
perforaied with 23 cycles of amplification for ARTSl gene and 18 cycles for GAPDH, in order 
to remain within a range of linear increase in the amount of PCR product. RT-PCR products 
were separated by agarose gel electrophoresis and blotted on Hybond N + nylon membranes 
following standard procedures in Sambrook SUPRA. Membranes were hybridized with the 
same probe and in the same conditions as for Northern blotting. The relative intensity of 
hybridization signals was analyzed with a Phospholmager system (Molecular Dynamics). 

[0078] Methylation analysis by Southern blotting. In order to identify the global level 
of methylation for the ARTSl locus, five micrograms of total genomic DNA were digested with 
Bgin alone or in combination with methylation-sensitive Hpall (Roche) using a total of 40 U of 
enzyme for 12 h. Digests were electrophoresed on 0.8% agarose gels and blotted on Hybond N+ 
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positively charged nylon membranes (Amersham Pharmacia Biotech) and hybridized with the 
same ORF probe as described before. 

[0079] Methylation-specific PGR. To analyze methylation levels in the 5' upstream 
region of ARTS 1, a region upstream of the first exon on ARTSl was amplified and bisulfite 
sequencing was carried out as described in Frommer, M. et al. A genomic sequencing protocol 
that yields and positive display of 5-methylcytosine residues in individual DNA strands. Proc 
natl Acad Sci U S A 89, 1827-31. (1992), which is incorporated herein by reference. Modified 
DNA (200ng) was subjected to PGR. PGR products were purified and directly sequenced in 
order to obtain average methylation levels. In addition, PGR products were subcloned and at 
least six clones were sequenced to confirm direct sequencing data. Because of the unavoidable 
contamination of normal cells in the tmnor specimens, we defined a GpG site as 
"hypermethylated" when more than 70% of PGR products contained bisulfite-resistant cytosines 
"Partial methylation" indicates detection of these products in 20-70% of the total products. 

[0080] LOH studies. The paired normal and colorectal tumor DNA samples were 
tested for LOH by PGR amplification with oligonucleotide primers for microsatellite markers at 
D13S165 andD13S273 using fluorescent-labeled primers (ABI). One single nucleotide 
polymorphism found inside the ORF of ARTSl (T442G) was heterozygous in about 45% of 
sequenced samples and was very usefiil for the rapid discrimination of 
informative/noninformative patients. The amplification products were run on an Applied 
Biosystems Model 377 DNA sequencing system (PE, AppUed Biosystems). The LOH data for 
39 paired nomial/tumor B-GLL samples used in this study were previously reported in BuUrich, 
F. et al. Minimal region of loss at 3ql4 in B-cell chronic lymphocytic leukemia. Blood 88, 
3109-15 (1996), which is incorporated herein by reference. 

[0081] Mutation detection. Primers used in mutation analysis were designed fi-om 
intronic sequences directly upstream of the second exon and within the 3' UTR region of 
ARTSl. PGRs were carried out for 35 cycles of 94°G for 30 s, 62'*G for 30 s and 72°G for 1 min 
using RedTaq genomic DNA polymerase (Sigma- Aldrich, St. Loms, MO), purified with the 
QIAquick PGR purification kit (QIAGEN) and then both strands were directly sequenced using 
the Applied Biosystems Model 377 DNA sequencing system (PE, AppUed Biosystems, Foster 
Gity, GA). The 203 normal controls firom the Italian population were analyzed by denatured 
high-performance Uquid chromatography (DHPLG) (Transgenomics, Omaha NE). The 
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temperature used for heteroduplex formation was 57*^C and all the samples with abnormal 
patterns were directly sequenced. 

[0082] Subcellular localization. The pEGFP Nl - ARTSl vector was prepared by 
digesting pBGFP Nl (Clontech) with Smal; the insert was obtained by amplifying the ARTSl 
full-length insert with Pfu where its stop codon was ehmmated in order to generate an ARTSl- 
EGFP protein fused at the C-terminus. An additional pEGFP Nl-ARTSl AC-terminus vector 
was prepared carrying the ARTSl protein truncated at position 446 of the ORF where the stop 
mutation is located. 293 cells were transfected by calcium phosphate (ProFection from Promega, 
Madison WI) and cultured on a cover slip and 24-48 h after transfection cells were analyzed by 
fluorescence microscopy as described in Ghosh, K. & Ghosh, H.P. Role of the membrane 
anchoring and cytoplasmic domains in intracellular transport and localization of viral 
glycoprotiens. Biochem Cell Biol 77, 165-78 (1999), which is incorporated herein by reference. 

[0083] Stable transfection of A549 cells. A549 cell line was cultured in RPMI 
supplemented with 10% fetal bovine serum for the following studies. ARTSl expression vector 
p-MV7-ARLTSl-sense was constructed by ligating the ARTSl open reading frame in sense 
orientation into a mammaUan expression vector pMV-7. All constructs were sequenced in order 
to ©cclude random mutants and were transfected by FuGENE6 transfection reagent according to 
the protocol (Boehringer Mannhiem). Transfected cells were selected with G418, 

[0084] Analysis of transformed phenotype. Sofi-agar colony assay of A549 wild - type 
and ARTSl stable transfectants were performed as described in Trapasso, F. et al. Rat protein 
tyrosine phosphatase eta suppresses the neoplastic phenotype of retrovirally transformed thyroid 
cells through the stabilization of p27 (Kipl), Mol Cell Biol 20, 9236-46.(2000) which is 
incorporated herein by reference. A suspension of 106 cells in PBS (0.2ml) was injected 
subcutaneousy into the right flank of Nu/Nu athymic mice (Jackson Laboratories Charles River, 
Cambridge, MA). Mice were sacrificed after 1,3,5, and 8 weeks and tumors were removed, 
weighed and measiued in three dimesions. All experiments were perfomed in accordance with 
iastitutional guidelines. 

[0085] Statistical analysis. Statistical analysis of results was performed using the 
Fisher's exact test; a P value of <0.05 was considered statistically significant. The cancer risk 
associated witii the specific mutations identified in this study was anayzed using the odds ration 
(OR). 
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Table 1 - ARTSl sequence analysis in human cell lines, tumors and nomial controls. 



Variant 
name^ 


Amino 
Change 


Amino acid 

f»nn QprvfiH cm 


Cell Lines 


Sporadic 


Familial 
Cancers, 
blood (%) 


Normals, 
blood 

(%) 


T50 to C 


Met 17 
toThr 


9/14 (65) 


2/80 
(2.5)^ 


0/216 


0/109 


0/272 


C65toT 


Ser 22 
to Leu 


3/14 (21), PMl 
site 


0/80 


2/216(1)'* 


0/109 


1/272 (0.4) 


^zoz to 
A 


T on OC 

to Met 


Leu only in 
ARTSI 


1 /OA /I \'5 

1/oU (1) 


0/216 


0/109 


0/272 


C392 to 

I 


Pro 
i^i to 
Leu 


4/14 (29) 


6/80 
(7.5) 


14/216 
(6.5) 


4/109 (4) 


17/272 
(6.25) 


T442 to 

c 


Cys 
14R to 
Arg 


Cys only in 
arts: J 


25/80 


127/216 


80/109 


182/272 
(07) 


G446 to 
A 


Trp 
149 to 
Stop 


12/14 (86) 


2/80 
(2.5) 


8/216 
(3.7) 


6/109 (6) 


10/475 (2.1) 


G490to 
A 


Glu 
164 to 
Lys 


9/14 (65) 


0/80 


1/216 
(O.S)'^ 


0/109 


0/272 



Note: 1 - We identified also several synonymous polymorphisms such as: CI 75 to T 
(Leu 59); G 297 to A (Ser 99); C345 to T (Val 1 15); G396 to C (Leu 132); G546 to A (Gin 182). 

2 - Data obtained by a multiple alignment of ARTS 1 protein with ARFl to ARF6 and 
ARLl to ARL7 at the GenomeNet CLUSTALW server. 

3 - Foimd only in melanoma cell lines. 

4 - Found only iQ thyroid adenomas. 

Table 2 - Clinical data from families with G446A (Trpl49Stop) mutation. 



-25- 



wo 2004/033659 



PCT/US2003/032270 



Proband, sex, age* 


Cancer type 


Cancer Family history 


KRR0003, female, 46 


B-CLL 


Twin sister G446A +ve with B-CLL 


TOR-IB, male, 57 


B-CLL and lung 
cancer 


Sister, 53, homozygous G446A with Thyroid 
adenoma; his son, 30, obUgate carrier. 
Essential thrombocytemia 
Brother, heterozygous G446A - normal 
Mother, dead, obligate carrier, B-CLL at 
80 yrs-old 

Father, 86, obligate carrier, B-CLL 


P/M 35003, male, ? 


• Gastric, 72 
Melanoma, 72 
Prostate, 73 


None 


P/M 35012, male,? 


Prostate, 66 
Melanoma, 67 


Mother, cancer, unknown location, ? 
Brother, prostate, 73 
Sister, ^T^lack moles", ? 
Daughter, breast, ? 


1054-22671, male, 
dead 


Melanoma, 50 
Lmig metastasis, 55 


Paternal uncle, melanoma, ? 
Paternal aunt, pancreatic, ? 
Paternal cousin, pancreatic, ? 
Paternal cousin, head and neck, ? 


15-265-S87, female, ? 


Bilateral breast 
cancer, 32 and 35 
Ovarian cancer, 50 


Daughter, 48, G446A carrier, imaffected 



Note: 1 - ? == Age data imknown 



Table 3 - Cell lines used in the described experiments 
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Hystotype 


Cell lines used 


Burkitt's lymphoma 


AG876, AS283, BL2, BL30, BL41, CA46, DA978, 
uauai, xijD-15, lijjoo, jiyoye, JLaucjces, JNaaalwa, 
P3HR-1, Raji, Ramos, RSI 1864, SKDHL and WMN 


IVJLlUUpXC IHyClKJllXa 


riUJNoi, JVlwl^AK, JNUl-±iy2y, KJrMlo22o and 
U266B1 


Large cell lymphoma 


DB and SR 


Lmnunoblastic B cell lymphoma 


JMl 


Diffuse mix lymphoma 


HT 


Hodgkin's disease 


RPM16666andHs445 


Non-Hodgjdn's disease 


RL 


B-ALL 


MV4;11,RS4;1 1,697 


1 -well iyxiipiiiJixx<ib oiiu. leiUkenuaS 


i^JtiJVl, JJei 1, HJrl, J1dB2, JdLul 102, MOLT-3, MOLT- 
4, andMJ 


Haiiy cell leukemia 


MoT 


CML-Eiythroid letikemia 


K562 




AD4y, AJhL, caiu-3, Jlo9, H4ov), H1299, SKMES, 498 
and 1285 




ASiV^l, BXf^^o, (^apan-z, t^rJrAC-l, xISTooT, 
MiaPaca, PANCl, PSNl, and SU8686 


Esophageal cancers 


TBI, TB2, ElO, TE15, KY200 and KY300 


A/Tfili onsml* TYiplfmrtinfi 


iVll^, lUU/ Mr, jX o, WJVL 200.4, 397 MEL, 13443 
and four cell lines derived from melanoma patients 
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Colon carcinoma 


LoVo 


Cervical carcinoma 


HeLa 



Table 4 - Primers used in the described experiments 



Primername 


Primer sequence (5 '-3') 


Application 


3'-ex2F 


5' - CCA TGG GTT CTG TGA ATT 
CCA GAG G (SEQ ID NO:5) 


Northern, blot analysis 


5'-ex2R2 


5' - CAG TGG TCC TGG AAT CTC 
TCT AGA C (SEQ ID NO:6) 


3'exlF 


5' - GCC AGC AGA AAG CAG CTC 
CAT AGG (SEQ ID NO:7) 


Reversed Transcription PCT 
(RT-PCR) analysis 


5'ex2Rl 


5' - TTC AGG AGG CTC CAC AGG 
CTC TGC (SEQ ID NO:8) 


MET-F 


5 ' - GAG GTA TGT ATT GAA AG 
AAG AGG (SEQ ID NO:9) 


Mefliylation-specific PCR 


MET-R 


5' - AAC AAA ACC CAA TAA CAA 
CTC CA (SEQ ID NO:10) 


ORF-Fl 


5' - CAG AAG ACA GTA GCT GAT 
GTG (SEQ ID NO: 11) 


Genomic Mutation detection 


ORF-R2 


5' - GAG CAA AGA TAT GCT GCT 
CTG (SEQ ID NO: 12) 


Mael-Fl 


5' - GCT GAG TCC AGA GAG ATT 
CCA GG (SEQ ID NO: 13) 


G446A(Trpl49Stop) 
detection by Mael digestion 
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Mael-Rl 


5* - TCT CGC CTG CAG ACA CAT 






GC(SEQIDNO:14) 





Table 5 - Expression levels and methylation status of the ARLTSI promotGr in human 
cancer cell lines 



Name 


Origin 


ARTSl expression* 


Methylation 


Normal lung 1 


Normal lung 


+ 


Low 


Normal lung 2 


Normal limg 


+ 


Low 


A 549 


Lung carcinoma 


□* 


Hypermethylation 


AFL 


Lung carcinoma 


□/+ 


Hypennethylation 


Calu-3 


Lung carcinoma 




ND 


H1299 


Lung carcinoma 


P 


H3T)ennethylation 


H69 


Lung carcinoma 


□/+ 


ND 


1285 


Lung carcinoma 


0/+ 


ND 


H460 


Lung carcinoma 




Hypermethylation 


Lymphoblastoid 1 


Immortalized lymphoblasts 


+ 


Low 


Lymphoblastoid 2 


Immortalized lymphoblasts 


+ 


Low 


Dell 


T cell lymphoma 


□/+ 


Hypennethylation 


HH 


T cell lymphoma 


□ 


Hypermethylation 


HSB2 


T cell ALL 


□ 


Hypennethylation 


HuT 102 


T cell lymphoma 




Hypennethylation 


K562 


CML-Exytbroid leukemia 


□ 


ND 




T cell lymphoma 


□ 


Hypermethylation 
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JVLO 1 


T cell lymphoma 


□ 


Hypermethylation 


Ab 283 


Burldtt's lymphoma 


+ 


Low 


DLs 41 


Burkitt's lymphoma 




Low 


FoN 1 


Pancreatic carcinoma 


;□/+ 


Hypermethylation 




XT oiiurcauc caTcmoma 




Hypermethylation 


HeLa 


Cervical carcinoma 


□; 


Hypermethylation 


SW 480 


Colon carcinoma 


D 


ND 


G-361 


Melanoma 


Q. 


ND 



a : +, normal expression; +/-, reduced expression and absent expression; ND - not done 



Table 6 - Allele frequency of G446A (Trpl49$top) in unrelated cancer patients 
and control cases. 



Cancer patients 


Normal controls 


Tumor type 


Source 


Sample 

size, 

origin 


G446 
A 


Source 


Sample 

size, 

origin 


G446 
A 


Colorectal cancers 
"sporadic" 


Bucharest, 
Romania 


58, 

tumor 


2 


Philadelp 
hia 


116, 
blood 


1 


Breast "sporadic" 


Ferrara, 
Italy 


38, 

tumor 


3 


Bucharest 


156, 
blood 


2 


Breast "sporadic" 


Aarhus, 
Denmark 


10, 

tumor 


0 


Ferrara 


203, 
blood 


7 


CLL "sporadic" 


US 


39, 

tumor 


0 








Lung "sporadic" 


Milan, 


5, tumor 


1 
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Italy 












inyroia sporaaic 


Catanzaro, 
Italy 


65, 

tumor 


1 

1 








CLL familial 


Paris, 
France 


11. 

blood 


1 








CLL familial 


US 


6, blood 


1 








Breat familial 


Philadelphi 
a, PA 


69, 
blood 


1 








Melanoma + 
prostate 


Philadelphi 
a, PA 


17, 

blood 


2 








Pancreatic + 
melanoma 


Philadelphi 
a, 

PA 


6, blood 


1 








Idiopathyc 
Pancytopenia 


Bucharest, 
Romania 


1, blood 


1 








Total 




325 


14 

(4.30 
%) 




475 


10 

(2.10 
%) 
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WHAT IS CLAIMED IS: 

1 . An isolated protein comprising the amino acid sequence of SEQ ID NO:2. 

2. An isolated nucleic acid molecule that comprises a nucleic acid sequence that 
encodes the protein of claim 1. 

3. An isolated nucleic acid molecule comprising SEQ ID NO:l or a fragment thereof 
having at least 10 nucleotides. 

4. The nucleic acid molecule of claim 3 consisting of SEQ ID NO:l. 

5. A recombinant expression vector comprising the nucleic acid molecule of cldm 3, 

6. A host cell comprising the recombinant expression vector of claim 5. 

7. The nucleic acid molecule of claim 3 consisting of a fragment of SEQ ID NO: 1 
having at least 10 nucleotides. 

8. The nucleic acid molecule of claim 3 consisting of a fragment of SEQ ID NO: 1 
having 12-150 nucleotides. 

9. The nucleic acid molecule of claim 3 consisting of a fragment of SEQ ID NO: 1 
having 15-50 nucleotides. 

10. An oligonucleotide molecule comprising a nucleotide sequence complraientary to 
a nucleotide sequence of at least 5 nucleotides of SEQ ID NO:l. 

1 1 - The oligonucleotide molecule of claim 1 0 wherein said oligonucleotide molecule 
comprises a nucleotide sequence complementary to a nucleotide sequence of 5-50 nucleotides of 
SEQIDNOrl. 
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12. The oligonucleotide molecule of claim 10 wherein said oligonucleotide molecule 
comprises a nucleotide sequence complementary to a nucleotide sequence of 10-40 nucleotides 
ofSEQIDNO:!. 

1 3 . The oligonucleotide molecule of claim 1 0 consisting of a nucleotide sequence 
complementary to a nucleotide sequence of at least 10-150 nucleotides of SEQ ID NO:L 

14. The oligonucleotide molecule of claim 10 consisting of a nucleotide sequence 
complementary to a nucleotide sequence of at least 18-28 nucleotides of SEQ ID NO:l. 

15. An isolated antibody which binds to an epitope on SEQ ID NO:2. 

16. The antibody of claim 15 wherein said antibody is a monoclonal antibody. 

17. A method of identifying modulators of Caspase- 1 protein protease activity 
comprising the steps of: 

perfonning a test assay by contacting a Caspase-1 protease protein with a Caspase-1 

substrate in the presence of a test compound, 

determining the level of processing of said substrate by said protease, and 
comparing said level to the level of processing of a Caspase-1 substrate by Caspase-1 

protease protein in the absence of said test compound. 

18. The method of claim 17 wherein said protein has SEQ ID NO:2. 

19. The method of claim 17 wherein said substrate is FKBP46 protein. 
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pALTSl-gft> 



pAC-ALTSl-gfp 
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7/7 

°^^SS^S^^^°^^°^^^^A°CCACCTGGGAAAAGGAAGCACAAAAAGGAGAAGCACrCAAC 
GGCTGCTClXKnrCCTTCXXJATCnXJGCTCTTGGGTCATGCCTGGCCAGCAGAAAGCAGCTC^^ 
^^^-SSS^^^^'^^^'^^^^^^^°<=^°TCTAATAGTAACAC^^ 
CCAT^TTCTGTGAATTCCAGAGGTCACAAGGCGGAAGCCC^^ 

°OS22?F^°^^^^^°^^<^^C^GCTGAAGGGCCACCAGCTGGTGGAGACCCTGCC^^ 

GGTITCAACGTGGAGCCTCTGAAAGCTCCTGGGCACGTGTCACTGACTCTCTGGGACGTTGGGGC^^ 

AGGCCCCGCTCAGAGCCAGCTGGAAGGACTATCTGGAAGGCACAGATATCCrCGTGTACGTGCTCGA 

CAGCACAGATGAAGCCCGCTTACCCGAGTCGGCGGCTGAGCTCACAGAAGTCCKJAACOACCCC^^ 

CATGGCTGGCGTCCCCTTCTTGGTGCTGGCCAACAAGCAGGAGGCACCTXSATC^ 

AGATCAGAAACAGGCTGAGTCTAGAGAGATrCCAGGACCACTGCTGOGAGCrcCGGGGCKJCAGTC 

CCCTCACTGGGGAGGGGCTGCCCGAGGCanXK:AGAGCCTXmK}AGCCnx:CTOAAATC^^^^ 

CATGTGTClXK:AGGCGAGAGCCCATGQGGCroAGa}CXSGAGACAGCAAGAGATCrrGATCC>^C^ 

TATCAAACAAGAAATGCTGGCrTGGCCCGGTGGCTCATGCCTGTAATCCCAGCACTGTGGGAGACCA 
CGGTGGGQGAATCCCTrGAGCCCAGGAGTTGGAGAGCAACATCACAACACCCCATTTCTACTAATAA 

o^TS?^^'^°SF^'^°^°°'^^°^°°"^^°'^°A<3CCAAGATCGCGCCACrGCACT^ 
GCAACAGAGTGAGACCCTGTCTCAATAATAATAATAATAATAATGATGATACTCTAAGAAAAAAATC 
TCAACATACTTCAnTAATAGCTCGTTACCAAGTGTGAATGAAGCAATATGTCATAATAGAGTAGCC 
^S!?2F°*^'^'^'^'^'^'^^°^°^*^'^^TTCTCAAATAGGGAAAGAGGT^ 

GCCAGGTGCAGTGGCTCATGGGCGGAGGAGGGCAGATTACrrGAGGCTAGGAGTTCAAGACCAGCC 

TSSSS^^:^°°'^°^'=CCCATCTCTACTGAAAATACAAAAATTAGGCATAGTXKiTOC^^^ 

^™^^^^'^^*^^°°^°°"°^°°CA°^GAATCGCTrGAACCX:AGGAAGTCGAGGTTGCAGTG 

AGCCGAGATTGTGCTGCTGCACTCCAOCCTCGQTCAAAAAGACAGGCTGTGTCTCCAAAAAG^^ 

AAAAAAGTCAAATTCAAATATCATCTGGACATGTCACAATOGATCGCGGATCClTATGAGTGATrTr 

CCCCAGTGGC<XCTXK3GGATGTGCCACroTCACTCAGAAGGGCAAGCTAGGCAGGGCCCAT(XA^^ 

CACACCATCCAAAATTTATCCTTG i'l ll"ll-14'CTGATGCTAATTAGCCTCTCCCGArnTATGACATCT 
TCTCTTOATCTTTITCAAAAACTCAT^ 

T?^^2^'^^^°^'^^^'^^^'^'^""A™'IT'CATTAATIT 

^^^^I^^G;^TGACAGTGACGAATGCCTTCTGTCTCATGATTGACAGGGAAAAGGAAGGT^GA^^^ 

AGCATCCTAGAAGGCTCATCAGGTGATCATTACCTAGCATCCATGAAGCACCTGAAATTATITGCAA 

AATGTTACGCrTTGGACCATTrTrCCGGGGAAGGAGATCCAGAACTTITrACCAGAT^ 

^?;^?J?^'^^^^'^^°'"^^<=^^'r^^^°ATGTGGTTGTTGTATCCCrCATC^ 

TTCTGTAATCTGAGTmTrAATGGCAAGTGGCCTATATTTAaCACCTOTrcrcATGTTAA^^ 

GAATGTTAGATATTCTTTCTTATCCTGGACTGGTrcrcTCTATCrCTGOAGTAATGCAGTA^^ 

^S^TS^JA^CCTCCTAAAATCTGAGATCTGC^^ 

GAATGAGATAAGGCCTCAGGCCTCATGGAAGGTGAGACAGTAAAGACATTACTCCCATAAAAATGT 

GGTCITGGTGTriTACATAGACTGTCTCTAATTCTCACAACrCTGCAAAATATATA^^ 

TAAAACrACAAACTOAGGCTCAGAGAAGQTGTGACCTCTTGTTGCTTGAGGCACAGAGTTATAAAGT 

/^CATATCTGGAATTTGAAATGAGATCTGTTTAGGGCTAATGCTGCATTriTCTA 

CTAGAAGGTTTAAGCTAGGTAGGCTTTCAGCCAGCAGACATGATGGGGAGAGCCITCTAATAAGAGG 

GAAGAGACTGCTTGGAAGCATGAAGGGAGGTGTAAGAAAGATAAGTAAGTCAGTGTACTTGCAACA 

GAGGCTTGGGATGAAGGGTGGGTGAAGTTGACATCACGATAGAAAACAAAACTOGAATCGGAGTIT 

AGGTCCAATTTGGGCAAGGTrGTTTGAATTrCAATAATCAGGGGTTTGGGTCAAGGAAOAAAAATCA 

lS2°j^^^^^^'^^°°AGGATAATTITGTGGTAGTGTGGAGGTCA^ 

CTTGGAGCTGGGAAGGCAGGAAACCGGCTAGATGACCATCACACAGCAAAGGAGGGAGTCGAAGA 
^A?ATS^S5^tT?^^°'^°^^'^^'^^°^^^^CAGTT^^ 

A^S^J3Il^^°^^'^^^^'^°^'^^°^<^A^°°AATGCCATGAACTGGAGA^^ 

AACCAGGATGGGACGGGGCTGGAACAGCTGGGTTCAGCTTTTCCAGGGTGGGTACGTGTITGGTrAT 

TCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIG. 7 
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TJU2863.ST25.txt 
SEQUENCE LISTING 

<110> Croce, Carlo M. 

<120> NOVEL TUMOR SUPPRESSOR GENE AND COMPOSITIONS PND METHODS FOR 
MAKING AND USING THE SAME 

<130> 1^-2863 

<150> US 60/417^842 
<151> 2002-10-11 

<160> 14 

<170> Patentin version 3.2 

<210> 1 

<211> 3791 

<212> DNA 

<213> Homo sapiens 



<400> 1 
cacccaagtc 


tgagttgcta 


aaaaatggag 


ctgtcactgg 


gccttgctct 


gccaggacct 


60 


gcagagccgg 


ggacctctct 


gtggcaagcc 


cagcaagatg 


actgctctga 


ggcgccctag 


120 


ggctgaggga 


ggggccgtga 


caccagcccc 


gccccccagc 


cacctgggaa 


aaggaagcac 


180 


aaaaaggaga 


agcagcaacg 


gctgctctgc 


ttccttccca 


tctcgctctt 


gggtcatgcc 


240 


tggccagcag 


aaagcagctc 


cataggggag 


gagagccacg 


caggatctca 


cagctgcagt 


300 


ctaatagtaa 


cacagaggat 


tcagcagtgg 


ccaccatggg 


ttctgtgaat 


tccagaggtc 


360 


acaaggcgga 


agcccaggtg 


gtgatgatgg 


gcctggactc 


ggcgggcaag 


accacgctcc 


420 


tttacaagct 


gaagggccac 


cagctggtgg 


agaccctgcc 


cactgttggt 


ttcaacgtgg 


480 


agcctctgaa 


agctcctggg 


cacgtgtcac 


tgactctctg 


ggacgttggg 


gggcaggccc 


540 


cgctcagagc 


cagctggaag 


gactatctgg 


aaggcacaga 


tatcctcgtg 


tacgtgctgg 


600 


acagcacaga 


tgaagcccgc 


ttacccgagt 


cggcggctga 


gctcacagaa 


gtcctgaacg 


660 


accccaacat 


ggctggcgtc 


cccttcttgg 


tgctggccaa 


caagcaggag gcacctgatg 


720 


cacttccgct 


gcttaagatc 


agaaacaggc 


tgagtctaga 


gagattccag 


gaccactgct 


780 


gggagctccg 


gggctgcagt 


gccctcactg 


gggaggggct 


gcccgaggcc 


ctgcagagcc 


840 


tgtggagcct 


cctgaaatct 


cgcagctgca 


tgtgtctgca 


ggcgagagcc 


catggggctg 


900 


agcgcggaga 


cagcaagaga 


tcttgatcca 


gacagagcag 


catatctttg 


ctcatacaaa 


960 


ctagaagaac 


cagctgatcc 


ttgagaaatt 


tacgcttagt 


ctatcaaaca 


agaaatgctg 


1020 


gcttggcccg 


gtggctcatg 


cctgtaatcc 


cagcactgtg 


ggagaccacg gtgggggaat 


1080 


cccttgagcc 


caggagttgg 


agagcaacat 


cacaacaccc 


catttctact 


aataatcaaa 


1140 


aaattggccg 


ggcatggtgg 


catgtgcctg 


tagtcccagc 


tacttgggag 


gctgaggcag 


1200 


gagaatcgct 


tgagcccaag 


aggtagaggt 


tgcagtgagc 


caagatcgcg 


ccactgcact 


1260 


ccagtctggg 


caacagagtg 


agaccctgtc 


tcaataataa 


taataataat 


aatgatgata 


1320 


ctctaagaaa 


aaaatctcaa 


catacttcat 


ttaatagctc 


gttaccaagt 


gtgaatgaag 


1380 


caatatgtca 


taatagagta 


gccactggtt 


gcataataat 


agagacctaa 


attctcaaat 


1440 
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agggaaagag 


gttttaaaat 


caaatttgag 


gccaggtgca 


gtggctcatg 


ggcggaggag 


1500 


ggcagattac 


ttgaggctag 


gagttcaaga 


ccagcctggc 


caacatggtg 


aaaccccatc 


1560 


tctactgaaa 


atacaaaaat 


taggcatagt 


ggtgcacgcc 


tgcagtccca 


gctactcagg 


1620 


aggttgaggc 


agaagaatcg 


cttgaaccca 


ggaagtggag 


gttgcagtga 


gccgagattg 


1680 


tgctgctgca 


ctccagcctg 


ggtgaaaaag 


acaggctgtg 


tctccaaaaa 


gaaaaaaaaa 


1740 


agtcaaattc 


aaatatcatc 


tggacatgtc 


acaatggatc 


gcggatcctt 


atgagtgatt 


1800 


ttccccagtg 


gcccctgggg 


atgtgccact 


gtcactcaga 


agggcaagct 


aggcagggcc 


1860 


catccaacag 


caggggtctg 


caggttagac 


gttccctgcc 


ctgggacgct 


cacccctggg 


1920 


caagaggctg 


gaagtbcaca 


ccatccaaaa 


tttatccttg 


ttttttttct 


gatgctaatt 


1980 


agcctctccc 


gattttatga 


catcttgtgt 


tgatcttttt 


caaaaactca 


ttttcttttt 


2040 


tttccttctc 


ttttctcctt 


cttgtagcac 


atatctttcg 


ttaaagatca 


gatcaataaa 


2100 


atattttatt 


tattcattaa 


tttaacaaaa 


aaaacagagc 


atttagtttg 


tggcaaaaac 


2160 


actgagcttt 


cgaatatgaa 


tcatgtgctt 


taggtgggag 


ttgtgaattc 


tgaagataca 


2220 


gatgacagtg 


acgaatgcct 


tctgtctcat 


gattgacagg 


gaaaaggaag 


gttgaccata 


2280 


gcatcctaga 


aggctcatca 


ggtgatcatt 


acctagcatc 


catgaagcac 


ctgaaattat 


2340 


ttgcaaaatg 


ttacgctttg 


gaccattttt 


ccggggaagg 


agatccagaa 


ctttttacca 


2400 


gattttcaaa 


gacatctgtg 


actcccaaaa 


gttaacaatc 


actgatgtgg 


ttgttgtatc 


2460 


cctcatccaa 


ccccagaaca 


ctttctgtaa 


tctgagtttt 


ttaatggcaa 


gtggcctata 


2520 


tttagcacct 


gttctcatgt 


taaacagctc 


tgaatgttag 


atattctttc 


ttatcctgga 


2580 


ctggttctct 


ctatctctgg 


agtaatgcag 


tataaattgg 


ccatcagtac 


cctcctaaaa 


2640 


tctgagatct 


gccaggcccc 


tcttctaaca 


ccaggttagg 


catgcttggt 


tatttccagt 


2700 


acttgtgagt 


caacatgttt 


caagacgctg 


tgttagacac 


tagggatgca 


aagatgaatg 


2760 


agataaggcc 


tcaggcctca 


tggaaggtga 


gacagtaaag 


acattactcc 


cataaaaatg 


2820 


tgaggagaga 


gactcagttc 


agcaactgtt 


tattctgttt 


attgagcact 


tacttggacc 


2880 


aagcactgtg 


gtcttggtgt 


tttacataga 


ctgtctctaa 


ttctcacaac 


tctgcaaaat 


2940 


atatatattc 


ccattttata 


aaactacaaa 


ctgaggctca 


gagaaggtgt 


gacctcttgt 


3000 


tgcttgaggc 


acagagttat 


aaagtaacat 


atctggaatt 


tgaaatgaga 


tctgtttagg 


3060 


gctaatgctg 


catttttcta 


caacatcatg 


cctctagaag 


gtttaagcta 


ggtaggcttt 


3120 


cagccagcag 


acatgatggg 


gagagccttc 


taataagagg 


gaagagactg 


cttggaagca 


3180 


tgaagggagg 


tgtaagaaag 


ataagtaagt 


cagtgtactt 


gcaacagagg 


cttgggatga 


3240 


agggtgggtg 


aagttgacat 


cacgatagaa 


aacaaaactg 


gaatgggagt 


ttaggtccaa 


3300 


tttgggcaag 


gttgtttgaa 


tttcaataat 


caggggtttg 


ggtcaaggaa 


gaaaaatcat 


3360 


gggacttgcc 


atttaggagg 


ataattttgt 


ggtagtgtgg 


aggtgaaata 


aagagaaaag 


3420 


ggaaccttgg 


agctgggaag 


gcaggaaacc 


ggctagatga 


ccatcacaca 


gcaaaggagg 


3480 


gagtggaaga 


gagatgagaa 


aattgagagc 


tattattaag 


aaaaacagtt 


gagagaggaa 


3540 
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gaatttgaag 


agggctcaag 


attttgagtc 


cacatgacag 


aaggactgga 


atgccatgaa 


3600 


ctggagaagg 


tgagcgctga 


agaaccagga 


tgggacgggg 


ctggaacagc 


tgggttcagc 


3660 


ttttgcaggg 


tgggtacgtg 


tttggttata 


gctgctttca 


gattgttcca 


ttatctgtac 


3720 


tcccaacaac 


cctgccggat 


atatttgttg 


gctttcactc aaaaaaaaaa 


aaaaaaaaaa 


3780 


aaaaaaaaaa 


a 










3791 



<210> 2 

<211> 196 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Gly Ser Val Asn Ser Arg Gly His Lys Ala Glu Ala Gin Val Val 
15 10 15 



Met Met Gly Leu Asp Ser Ala Gly Lys Thr Thr Leu Leu Tyr Lys Leu 
20 25 30 

Lys Gly His Gin Leu Val Glu Thr Leu Pro Thr Val Gly Phe Asn Val 
35 40 45 



Glu Pro Leu Lys Ala Pro Gly His Val Ser Leu Thr Leu Trp Asp Val 
50 55 60 



Gly Gly Gin Ala Pro Leu Arg Ala Ser Trp Lys Asp Tyr Leu Glu Gly 
65 70 75 80 



Thr Asp lie Leu Val Tyr Val Leu Asp Ser Thr Asp Glu Ala Arg Leu 
85 90 95 



Pro Glu Ser Ala Ala Glu Leu Thr Glu Val Leu Asn Asp Pro Asn Met 
100 105 110 



Ala Gly Val Pro Phe Leu Val Leu Ala Asn Lys Gin Glu Ala Pro Asp 
115 120 125 



Ala Leu Pro Leu Leu Lys lie Arg Asn Arg Leu Ser Leu Glu Arg Phe 
130 135 140 



Gin Asp His Cys Trp Glu Leu Arg Gly Cys Ser Ala Leu Thr Gly Glu 
145 150 155 160 



Gly Leu Pro Glu Ala Leu Gin Ser Leu Trp Ser Leu Leu Lys Ser Arg 
165 170 175 



Ser Cys Met Cys Leu Gin Ala Arg Ala His Gly Ala Glu Arg Gly Asp 
180 185 190 



Ser Lys Arg Ser 
195 
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<210> 3 
<211> 24 
<212> RNA 
<213> Ribozyme 

<400> 3 

caaagcagga gcgccugagu ague 24 



<210> 4 

<211> 24 

<212> RNA 

<213> Ribozyme 

<400> 4 

caaagcagga gcgccugagu ague 24 



<210> 5 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 5 

ccatgggttc tgtgaattcc agagg 25 



<210> 6 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 6 

cagtggtcct ggaatctctc tagac 25 



<210> 7 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 7 

gccagcagaa agcagctcca tagg 24 



<210> 8 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 8 

ttcaggaggc tccacaggct ctgc 24 



<210> 9 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Oligonucleotide primer 



<400> 9 

gaggtatgta ttgaaagaag agg 



23 



<210> 10 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide prdLmer 

<400> 10 

aacaaaaccc aataacaact cca 23 



<210> 11 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 



<2a0> 12 

<211> 21 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oligonucleotide primer 

<400> 12 

gagcaaagat atgctgctct g 21 



<210> 13 

<211> 23 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oligonucleotide primer 



<210> 14 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 14 

tctcgcctgc agacacatgc 20 



<400> 11 

cagaagacag tagctgatgt g 



21 



<400> 13 

gctgagtcca gagagattcc agg 



23 
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